Sample-Efficient Training of Robotic Guide Using Human Path Prediction Network
نویسندگان
چکیده
Training a robot that engages with people is challenging; it expensive to directly involve in the training process, which requires numerous data samples. This paper presents an alternative approach for resolving this problem. We propose human path prediction network (HPPN) generates user's future trajectory based on sequential actions and responses using recurrent-neural-network structure. Subsequently, evolution-strategy-based method only virtual movements generated HPPN presented. It demonstrated our proposed permits sample-efficient of robotic guide visually impaired people. By collecting 1.5 K episodes from real users, we were able train generate more than 100 required robot. The trained precisely guided blindfolded participants along target path. Furthermore, episodes, investigated new reward design prioritizes comfort during robot's guidance without incurring additional costs. expected be widely applicable robots interact physically humans.
منابع مشابه
assessment of the efficiency of s.p.g.c refineries using network dea
data envelopment analysis (dea) is a powerful tool for measuring relative efficiency of organizational units referred to as decision making units (dmus). in most cases dmus have network structures with internal linking activities. traditional dea models, however, consider dmus as black boxes with no regard to their linking activities and therefore do not provide decision makers with the reasons...
An Efficient Architecture for Robotic Path Planning
There are many path planning algorithms designed for mobile robots with software implementation. In the case of dynamic environments high-speed planning and recomputation of paths is necessary to avoid collision of robots with moving objects. A hardware-efficient algorithm is presented for finding a path of a mobile robot on image of an environment captured by an overhead camera. The algorithm ...
متن کاملSample Efficient Path Integral Control under Uncertainty
We present a data-driven stochastic optimal control framework that is derived using the path integral (PI) control approach. We find iterative control laws analytically without a priori policy parameterization based on probabilistic representation of the learned dynamics model. The proposed algorithm operates in a forward-backward sweep manner which differentiate it from other PI-related method...
متن کاملPrediction of human microRNA hairpins using only positive sample learning
MicroRNAs (miRNAs) are small molecular non-coding RNAs that have important roles in the post-transcriptional mechanism of animals and plants. They are commonly 21-25 nucleotides (nt) long and derived from 60-90 nt RNA hairpin structures, called miRNA hairpins. A larger number of sequence segments in the human genome have been computationally identified with such 60-90 nt hairpins, however the m...
متن کاملEfficient Path Kernels for Reaction Function Prediction
Kernels for structured data are rapidly becoming an essential part of the machine learning toolbox. Graph kernels provide similarity measures for complex relational objects, such as molecules and enzymes. Graph kernels based on walks are popular due their fast computation but their predictive performance is often not satisfactory, while kernels based on subgraphs suffer from high computational ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Access
سال: 2022
ISSN: ['2169-3536']
DOI: https://doi.org/10.1109/access.2022.3210932